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Large Populations Must be Genotyped to Detect Rare Alleles 




Allele frequency 
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Mutation Database 



I HGMD data ■ Simulation 
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Distribution of Codon Mutation Classes in the CFTR Gene 



i CFTR mutation data ■ simulation 
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FIGURE 3 



At High Thresholds, SNIDE is Remarkably Accurate 
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FIGURE 4 
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Matrix construction 



Matrix deployment 



Gather variation data collection with 
property of interest (e.g., disease- 
causing, high frequency, etc.) 
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Determine predictiveness scores from 
relative frequencies of variations 
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Modify predictiveness scores based on 
other considerations (e.g., codon 
usage, structure, etc) 
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Evaluate query sequence(s) against 
predictiveness matrix to identify likely 
variants have desired properties (e.g., 
disease-causing, frequency, etc) 



